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Roy Timo and Michele Wigger 


Abstract — We consider a cache-aided communications system 
in which a transmitter communicates with many receivers over 
an erasure broadcast channel. The system serves as a basic 
model for communicating on-demand content during periods 
of high network congestion, where some content can be pre¬ 
placed in local caches near the receivers. We formulate the cache- 
aided communications problem as a joint cache-channel coding 
problem, and characterise some information-theoretic tradeoffs 
between reliable communications rates and cache sizes. We 
show that if the receivers experience different channel qualities, 
then using unequal cache sizes and joint cache-channel coding 
improves system efficiency. 

I. Introduction 

Consider a network with one transmitter and many receivers. 
Imagine that the transmitter has a library of messages (or, data 
files), and suppose that each receiver will request and down¬ 
load a message during a period of high network congestion. 
In such settings, it is advantageous to move traffic away from 
the congested period using caching. The basic idea of caching 
is that the transmitter sends and stores “parts” of the library 
in local cache memories near the receivers beforehand, during 
periods with low network traffic. The caches provide this data 
directly to the receivers, so that less data needs to be sent 
during the congested period. 

The above problem is relevant to video-streaming services, 
where content providers pre-place data in clients’ caches (or, 
on servers near the clients), with the goal of improving latency 
and rate performance in high demand periods. The content 
provider typically does not know in advance which specific 
movies the clients will request, and thus the cached data cannot 
depend on the clients’ specific demands. 

Let us call the pre-placement of data in caches the caching 
phase, and the remaining communications phase the delivery 
phase. Cache memories are typically much smaller than the 
library, and the caching phase occurs before the receivers 
demands are known. A key engineering challenge is, therefore, 
to carefully choose and cache only that data which is most 
useful during the delivery phase. That is, one should cache 
data that minimises the rate needed to complete the delivery- 
phase downloads for any feasible receiver demands. 

Cache-aided communications systems have received signif¬ 
icant attention in the information-theoretic literature in recent 
years, and those works most closely related this paper are ifTl- 
HD. With the exception of HD, these works assume that the 
delivery phase takes place over a single rate-limited multicast 
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noiseless channel (a bit-pipe) that connects the transmitter 
to every receiver. In practice, however, the communications 
medium is sometimes better modelled by a noisy broadcast 
channel (BC). This scenario is considered in HD, where the 
BC is essentially a set of parallel links with different qualities 
to the various receivers, which models a wireless fading BC. 

This paper takes a similar approach to that of llTTI . and we 
assume that the delivery phase takes place over a memoryless 
erasure BC. However, in contrast to HD, we assume that the 
caching phase takes place over error-free pipes. The motivation 
for this simplified assumption is that the caching phase typi¬ 
cally occurs during periods of low network-congestion, where 
network resources are not a limiting factor. 

Our main contribution in this paper is a joint cache-channel 
coding scheme for the described setup for general demands, 
and a characterization of the capacity-memory region when 
the receivers wish to learn the same message. Our results 
show that when the receivers experience different erasure 
probabilities (different channel qualities), then 

• it is beneficial to employ unequal cache sizes at the 
receivers (larger cache memories at weaker receivers, and 
smaller cache memories at strong receivers); and 

• joint cache-channel coding techniques can provide sig¬ 
nificant gains over separated cache and channel coding. 

Allocating larger cache memories to the weaker receivers 
is quite natural because one then needs to communicate 
less data over noisier channels (see also HD). Interestingly, 
there is an additional benefit to asymmetric caches that arises 
when joint cache-channel coding is used during the delivery 
phase. The basic idea is as follows; Consider a degraded 
BC communications scenario (such as the erasure BC) with 
separate cache and channel coding. Here a stronger receiver 
can decode all the data that is sent to a weaker receiver 
during the delivery phase. In fact, the strong receiver could 
decode even more data, but it is limited by the weaker 
receiver. Now suppose that part of the message intended for the 
stronger receiver is stored within the weaker receiver’s cache: 
one can freely piggyback this part of the stronger receiver’s 
message on the message intended for the weaker receiver. 
The weaker receiver is not penalised because it knows what 
data is being piggybacked on its desired message, and its 
channel decoder can still resolve its desired message. While, 
simultaneously, the stronger receiver has decoded something 
about its desired message and therefore we have improved 
efficiently. Thus, thanks to the weaker receiver’s cache and 
a simple joint cache-channel coding scheme, we can send 
additional data to stronger receivers without any extra cost, i.e.. 
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extra rate-constraints. This additional benefit of asymmetric 
cache memories was not observed in GD, because a separate 
source-channel coding scheme was used for the delivery phase. 

II. Problem Definition 

A. Message library and feasible receiver demands 

We have a transmitter, K receivers and a library with D 
messages Wi, ..., Wd- The d-th message in the library Wd 
is independent of all other messages and uniform orQ 

where Rd > 0 is its rate and n is the transmission blocklength. 
We represent a particular combination of receivers’ demands 
by a tuple d = (di,..., dx) G {1, ..., D}^. That is, d rep¬ 
resents the situation where receiver 1 demands (i.e., requests 
and downloads) message Wd^, receiver 2 demands Wd 2 , and 
so on. Let 

denote the feasible set of all possible receiver demands. The 
feasible set V is known to the transmitter and receivers during 
the caching and delivery phases, but the specihc demand tuple 
d chosen from V is only revealed for the delivery phase. 

B. Caching phase 

For each receiver k € {1,..., A"}, the size of its cache is 
described by a nonnegative integer Mk, see ([T]i below. The 
transmitter sends 


Zfe := gkiWi ,..., Wd), 

to receiver fc’s cache, where ; 

Zfc. such that 

log|Zfc|<2"^L (1) 

The caching phase occurs during a low congestion period, 
and we assume that Zfc is reliably conveyed to receiver fc’s 
cache (for each k G {1,..., AT}). 

C. Erasure Broadcast Channel Model 


For our purpose only these marginal probabilities are relevant. 

We discuss a caching system in the next section that is built 
on separate cache and channel codes, and, for this reason, it is 
useful to recall the degraded message set capacity region for Q. 
A channel-coding rate tuple R{ 2 ,...,k}, ■ ■ ■, R{k}) 

is said to be achievable on Q if the following holds; For any 
e > 0 there exists an encoder and AT-decoders such that, for 
all k, the transmitter can send {R{k,...,K} — ^) information bits 
per channel use to every receiver in the set {k,k + l,..., AT} 
with an average probability of error less than e. The set of all 
achievable rates — the degraded message set capacity region 
— is given by the next proposition. The proposition can 
be distilled from II2, and we omit these details. 

Proposition 1: 


= < {R{1,...,K},R{2,...,K}, ■■■, R{K}) '■ 


E 


R{k,...,K} 

F{l-5k) 


< 1; R{k,...,K} > 0: 



D. Delivery phase 

For each feasible demand d € D, let 

D 

/d; 

d' = l 

denote the corresponding encoder at the transmitter. Given d G 
V and the library (Wi ,... ^Wd), the transmitter sends 

-.= UWd...,Wd), (2) 

where X" = {Xi,..., X„). Receiver k observes Yjf = 

• ■ •, Yk,n) according to the memoryless law Q. Let 

Pk.A ■■ y" X Zfc ^ {0,1,..., - 1} (3) 

denote the decoder at receiver k. Given demands d gV, cache 
content Zfc and channel outputs Y^, receiver k outputs 

Wk := pkA^k.^k) 

as its reconstruction of the dfc-th message Wdt,- 


The delivery phase occurs during a high congestion period, 
which we model by an erasure BC with input alphabet X := 
{0,1}^. Here F > 0 is a fixed positive integer, and each 
X G X is w A-bit packet. Due to congestion, some packets 
may be lost when, for example, router buffers overload. We 
denote the event of a lost packet with the erasure symbol A, 
and the EC’s output alphabet hy y := X 0 {A} (the same 
alphabet is used for all receivers). Fix 

1 > (5i > ^2 A ■ ■ ■ A A 0- 

Let Q{yi,... ,yk\x) ■= P[(yi, ■ ■ •, Yk) = (yi, •. ■, yK)\X = x\ 

be any probability law for the memoryless BC with marginals 

{ 1-4 if yk= X 

5k ifyfc = A , Vfe. 

0 otherwise 

^To simplify notation and help elucidate our main ideas, we assume 
throughout the paper that 2 ^^^. is an integer. 


E. Achievable rate-memory tuples 


Let 



K 


U \J{Wk^WdA 

dexife=i 


denote the probability of error at any receiver for any feasible 
demand. We call the collection of all encoders and decoders. 


{gi,g 2 ,---,gK} and {fd,<Pi,d,<f2,d, ■ ■ ■ ,<PK,d}d&v’ 

an (n, Ai,..., Rd,Mi, ..., MA-code. 

We say that a rate-memory tuple (Ai,. .., Rd, Xii, ..., 
Alif) is achievable if for any e > 0 there exists a sufficiently 
large blocklength n and an (n, Ai,..., Rd,Mi, ■ • ■, Mk)- 
code with Pe < e. 

Definition 1: We dehne the capacity-memory region C to 
be the closure of the set of all achievable rate-memory tuples. 

The main problem of interest in this paper is to determine 
the capacity-memory region C for a given erasure BC Q. 
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III. Motivating Examples 

We now demonstrate the potential of unequal cache memo¬ 
ries and joint cache-channel coding with three examples. Fix 
if = 2; P = {1,..., D}^-, Rd = R for all d; and 

Si = 4/5 and ^2 = 1/5. (4) 


A. Coded caching with symmetric caches 
Suppose that M.i = M .2 = M., and 

a:= M/R&[Q,D/2]. (5) 

Split each message Wd in the library into three sub-messages, 

Wd = {wf\wf\w^^), 

of rates AAjD, M./D, and R — 2MID. 

Caching phase: Store the sub-messages 

(lEf..., and (hEf ^\ ..., 


in the caches of receiver 1 and 2’s respectively. 
Delivery phase: The transmitter sends 


© w; 


(cl) 

d2 ’ 


(6) 


as a common message to both receivers, where the addition 
is modulo ^ sends as a private message 

to receiver 1 and 1E^“* as a private message to receiver 2. 
Notice that receiver 1 can recover Wd-i from the common 
message and while receiver 2 can recover lEd 2 from 

the common message and VE^“^ We use a good channel code 
to communicate these messages over the BC. 

Achievable rate-memory tuples: Proposition [T] asserts that 
the common message (|6]l and can be decoded by both 
receivers and can be decoded by receiver 2 whenever 

p_ A4 p_ 2A4 

F{l-S,)^ F{1-S2) - ^ ’ 

On substituting (|4]i, the inequality (|7]i simplifies to 
4 6 A4 

n<.Fil-S,) + --. ( 8 , 

All rate-memory tuples {R,..., R, A4,..., A4), with R and 
M satisfying (|5]l and (|8]l, are achievable. 


B. Separate cache-channel coding and asymmetric caches 

Now suppose that we have asymmetric caches A4i = 2A4 
and A42 = 0 for some A4 satisfying (|5]l. The total cache mem¬ 
ory available at both receivers remains unchanged, only now 
the memory at receiver 2 has been reallocated to receiver 1. 

Split each message Wd into two sub-messages, 

Wd = {wf\wf) (9) 

with rates 2A4/0 and R — (2A4/Z1) respectively. 

Caching phase: Store 1 receiver I’s 

cache. 

Delivery phase: We use a good channel code for Proposi¬ 
tion [1] to reliably communicate the above sub-messages. The 
transmitter sends as a common message to both receivers 


(although it is only used by receiver 1), and it sends VE^^*' 
as a private message to receiver 2. 

Achievable rate-memory tuples: Proposition [T] asserts that 
reliable communication is possible if 


R-2M. 


R 


< 1 . 


F(l-Ji) F{1-52) 

On substituting (|4li, the inequality (fTOl i simplifies to 


( 10 ) 


4 SAT 


All rate-memory tuples (i ?,..., R, A4,..., A4), with R and 
A4 satisfying (|5]l and (fTTTi are achievable. 


C. Joint cache-channel coding and asymmetric caches 

As in Section llll-BI Let AA 2 = 0 and A4i = 2A4, for some 
A4 satisfying (|3, and split each message Wd into two sub¬ 
messages with rates 2M/D and R—{2M/D) respectively. 

Caching phase: Store (1E/‘^*\ ..., at receiver 1. 

Delivery phase: Transmission takes place in two phases 
using timesharing. First phase of length Pin, for some Pi G 
[0,1]: The transmitter sends 

as a common message to both receivers using a joint cache- 
channel code. Second phase of length (I — Pi)n” The trans¬ 
mitter sends 1E^“* to receiver 2 using a point-to-point channel 
code. Receiver 1 tries to decode 14/^“* and receiver 2 tries 
to decode ,W^^\wj^^). A key observation here is 

that is stored in receiver I’s cache. As we see in a 

moment, for a G {0,^}, this allows to freely piggyback 
receiver 2’s message VE^^'^ on receiver I’s message lE^/* 
without compromising the rate to receiver 1. 

Achievable rate-memory tuples: By Tuncel’s seminal broad¬ 
casting with side-information result ifTSll . communication in 
phase 1 (to both receivers) is very likely to be successful if 
the following two conditions hold: 

R-'^ <Fil-Si)pi (12a) 

R<Fil-S2)Pi; (12b) 


communication in phase 2 is very likely to be successful if 

R-‘^ <Fil-S 2 )il-Pi). (12c) 

Inequalities (fTSl i prove achievability of all rate-memory tuples 
{R,... ,R, A4 ,..., A4), with R and A4 satisfying Q and 


|F(1-^i) + 2M, if^e[o,2i2] 

2^1-^!) + # iff 


(13) 


D. Discussion 

Comparing the rate-memory tradeoffs in (O, (fTTl i and (fOl) . 
we see that it is advantageous to use unequal cache sizes 
and joint cache-channel coding. In particular, allowing larger 
caches at the weaker receivers (with higher packet erasure 
probabilities) both reduces the delivery-phase rates to the 
weaker receivers and increases rates to the stronger receivers. 
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IV. A Joint Cache-Channel Code eor 
Arbitrary Demands 

We now describe a joint cache-channel code that can be 
applied for any set of feasible demands T>, but we restrict 
attention to equal message rates 

Rd = R, d G {1,..., D}. 


We first treat the case where the Kq weakest receivers (re¬ 
ceivers 1 to Kq) have equal cache sizes and the remaining 
receivers do not have caches: 


Mk = 


M if k<Ko 
0 if k> Kq 


(14) 


We explain later in Section IIV-BI how the scheme can be 
generalised to setups with unequal cache sizes. 


A. Scheme for cache sizes satisfying Gil 

Preliminaries: Choose a positive integer t < Kq, and let 



Split each message Wd into (r -|- 1)-sub-messages, 


Wd = {W^^\. 


where 



FFWe{0,l,.. 

2nR{i) _ 

and 

1 

f M 

for i = 1, 2,..., T 

AW := \ 

1 ’ 

1 

1 Dt 

for i = T 1. 

Caching Phase: Consider the Kq weakest receivers 


Ai, 72.2, ■ 

. . , TZr 


denote the r different subsets of Kq] with size t. For 

each i = 1,2,... ,T, take the tuple 

and store it in the cache of each and every receiver in TZi. Here 
we have stored sub-messages in receiver fc’s cache 

(for each k G {1,2,..., Kq}) with a total memory requirement 

Delivery phase: The demand tuple d S is given, and we 
are required to communicate message Wd^ to receiver 1, Wd 2 
to receiver 2, and so on. 

We consider sets of (f 4-1)-receivers in {1,..., Kq}. Within 
these sets, each subset of t receivers shares a sub-message 
that is demanded (but unknown) by the remaining it + l)-th 
receiver. For each set of (f-|-1 (-receivers, we form the “XOR” 
of the (f -f 1) sub-messages having the two above mentioned 
properties, that is, being known at t of the receivers and 
demanded by the remaining {t -f l)-th receiver. For example. 


for the subset of receivers {1,..., f -|- 1}, we form the XOR 
message 

t+i 




(ik) 
dk ’ 


k=l 


where the addition is modulo (or, equivalently, 

a bitwise XOR operation); and for each k G {l,...,f-|-l}, 
ik i& such that 


^{l,...,k-l,k + l,...,t + l}. (15) 

Notice that (fTST i implies that is stored in the caches of 

receivers 1,... ,k — l,k + 1,... ,t, but not at receiver k. 

We use a time-sharing scheme to send the XOR messages 
as well as all other messages to be transmitted. The time¬ 
sharing comprises K phases. Each phase k G {1,.. . ,K} is 
constructed so that it can be decoded by Receivers k,k + 
1,... ,K. Phase k is of length /3fcn, where 

K 

= 0</3fc<l. (16) 

fc=i 


In phase k G [1,..., Kq}, we send 

• the XOR messages that are demanded by receiver k but 
not by receivers 1 to fc — 1; 

• the uncached message demanded by Receiver A:; 

• the first nC^, j. bits of sub-messages HQ. , for every k G 

{Kq + 1,... ,K} and every i G {1,...,t} such that 
k G TZi- These messages are all known to receiver k and 
therefore do not limit the decoding at receiver k. The 
rates are parameters of a scheme. As we shall 

see, when they are chosen sufficiently small, but positive, 
and < 6k, then sending these bits does not limit the 
decoding at receivers k+1, k+2 ,..., A". In fact, similarly 
to our motivating example, in this case, the transmitted 
bits of sub-messages can be freely piggybacked on 
the other messages transmitted in this phase k. 

In phase k G {Kq -|- 1,..., AT} we send: 

• the sub-messages of that have not been sent in any 
previous phase. 

Achievable rate-memory tuples: 

Proposition 2: A rate-memory tuple {R,... ,R,A4i = A4, 

..., Mko = 0,..., 0) is achievable if for some 

• positive integer f; 

• nonnegative A"-tuple (/Ji,... ,/3k) satisfying (fT6l i: and 

• nonnegative real numbers {C^, with k G {1,..., ATo} 
and k G {Kq -|- 1,..., AT}; 

the following (AT -|- AToj-conditions in G3 hold. 

1) For each k G {1,..., Kq — t — 1}, we have 


R<F{1- Sk) + 





(17a) 


and 

K 

i? + E ^kfk < ^(1 - 4+i) 

k—KQ-\-l 
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M (fKo\ [Ko-k 

(17b) 

2) For each k G {Kq — t,..., Kq}, we have 


and 

K 

r+Y. 

k—Ko-\-l 

3) Finally, for each k G {Kq + 1,..., A"}, we have 
Ko 

i?- ^ <F(l-4). (17e) 

fc'=i 

Proof outline: For each k G {l,...,Aro — t}, Condi¬ 
tion (I17ab ensures that receiver k can reliably decode the sub¬ 
messages sent during phase k, and Condition (I17bb ensures 
that all of the stronger receivers in {fc-|-l,...,Ar} can also 
reliably decode these sub-messages. Similarly, Condition (fTTa 
ensures that each receiver k G {Kq — t,..., Kq} can reliably 
decode the sub-messages sent in phase k, and Condition (I17db 
ensures that all of the stronger receivers in {fc-|-l,...,Ar} 
can also reliably decode these sub-messages. Finally, Condi¬ 
tion (fTTel) ensures that each receiver k G {ATq-I- 1,..., AT} can 
decode the sub-messages sent in phase k. ■ 

Discussion: The parameters describe the gain that 

our scheme achieves over separate cache-channel coding 
schemes. If some of these rates are strictly larger than 0, 
then our scheme strictly outperforms separate cache-channel 
coding. It is possible to choose them strictly positive whenever 
the erasure probabilities (5i,..., Skq are not all equal. 

We took advantage of the fact that receiver k has already 
cached the additional nCf. j. message bits that are sent in 
phase k. Some of these bits are also available to the next- 
stronger receivers fc-|-l,fc-|-2,.... For simplicity, we ignored 
this fact in our analysis, and it is likely that further gains can 
still be made. 

B. Scheme for unequal cache sizes 
Assume now that 

Mi>M2>---Mk>0. (18) 

Our scheme in the previous subsection is easily extended to 
this more general setup using time-sharing. Specifically: Let 
Pi,, Pk be real numbers in the interval [0,1] that sum up 
to 1. Over a fraction of time Pi, i G {1,..., A"}, we use our 
scheme in the previous subsection assuming that only the first 
= K + 1 — i receivers have caches of equal cache size 
= Pi — M-K-i+i)- (Set M.k +1 — 0.) 

V. Single Common Demand 
In this section we consider the optimistic case where all 
receivers demand the same message. This corresponds to 

R = {(dll ■ • ■ G {1,... ,D}^ ■. di = d2 = ■■■ = dic}- 

The rates Ri,... ,Ro can be arbitrary, i.e., do not have to be 
equal as in the previous section. 


R<F{1- 4) + (17c) 

+ (17d) 



A. Result 

Theorem 3: A rate-memory tuple {Ri,..., Ro,Mi ,. •., 
Ad if) is achievable if and only if, 

Rd< min [{l-5k)F + Mk,d). d G {1,..., O}, (19) 
for some nonnegative numbers {A4k,d} that satisfy 

D 

ke{l,...,K}. (20) 

d=l 

Proof: See the following two subsections. ■ 

We thus again wish to allocate small cache sizes to strong 
receivers and large cache sizes to weak receivers. 

If we used separate cache-channel codes. Constraint ( fTOb is 
replaced by 

max {Rd - Mk,d) < min (1 - 5k)F, (21) 

ke{i,...,K} fee{i,...,if} 

and the benefit of having unequal cache sizes {Ad^} at the 
different receivers disappears. 


B. Proof of achievability 

We propose the following scheme. 

Caching phase: Each receiver k stores in its cache the first 
nM.k,d bits of each Message Wd, for d G {1,..., L}, where 

N 

Y^Cd<Mk, (22) 

d=l 

in order to satisfy the cache-memory constraint. 

Delivery phase: Assume di = 4 = • ■ • = Ak = d*. Use an 
i.i.d. Bernoulli-1/2 point-to-point code to send Message Wd* 
to all receivers. Each receiver k knows the first nA4k,d* bits 
of this message, and thus during its decoding it can restrict 
attention to the part of the codebook that corresponds to these 
bits. Eor receiver k it is thus as if the transmitter had sent only 
its missing bits over the channel. 

Alternatively, a joint cache-channel code based on Tuncel’s 
virtual binning technique m can be used for the delivery 
phase. 

Achievable rate-memory tuples: By o, whenever 

Rd-Mk,d < il-6k)F, yd€{l,...,D}, k€ [I,..., K], 

(23) 

the probability of error can be made arbitrarily small as n —> 

c». 


C. Proof of Converse 

Eix a block length n, and define 


Mk,d = -I{Wd;Zk), 
n 


kG{l,...,K}, dG{l,...,D}. 

(24) 


For each A: € {1,, K}, 

N D 

y^Mk,d=y^-i{wM 

d—l d—1 
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= ^^{H{Wd)-H{Wd\Zk)) 

1 ^ 

= -{H{Wu-.-,WD)-Y.H{Wd\Tk)) 

d^l 

< -{H{Wu ...,Wd)- H{Wi,..., WN\Zk)) 

n 

= -I{Wi,...,WD-,Zk) 

n 

<-H{-lu)<Mk, (25) 

n 

where the second and fourth equalities follow by the definition 
of mutual information; the third equality because the mes¬ 
sages are independent; the first inequality because the sum of 
marginal entropies of a tuple of random variables, is at least 
as large as the joint entropy of this tuple. 

In the following, let denote any sequence that tends 
to 0 as n —oo. Fix an achievable rate-memory tuple 
(i?i,..., Rn, -Ml ,..., A4k)- Also for an arbitrary large n, 
let {Zi,..., Zx}, {fd}, and denote cache content, en¬ 

coding functions, and decoding functions achieving this rate- 
memory tuple. Fix now d* € {1,, D} and k € {1,..., K}, 
and let X" = fd*{Wi ,..., Wd) and y" denote inputs and 
outputs corresponding to demand d* = ... ,d*). We 

have 

Rd* < -HiWd*) 

n 

= -I{Wd*;Y,^,Zk) + -H{WM:^,Zk) 

n n 

< -I{Wd*;Y^\Zk) + -I{Wd*-,Zk) + en 

n n 

1 "" 

= -J2 {H{Yk,t\^k, Y*-^) - HiYk,t\Wd*,Y^-\Zk)) 

+Mk,d* + Cn 
1 " 

< — {H{Ykd) — H{Ykd\Xkd)) + Adfc.d* + En 

t=l 

1 ” 

= — liXk u Xk,t) + ^k,d* + Cn 

t=l 

^ (1 ~ 5k)F + ■M.k,d* + En, (26) 

where the second inequality follows by Fano’s inequality; the 
third inequality because conditioning cannot increase entropy 
and because of the Markov chain (Wd*,Y^~^Zk) Xt ^ 
Ykd', and the last inequality by the capacity of the erasure 
channel; all equalities follow by the definition and the chain 
rule of mutual information. 

Letting n —?► oo, and thus e„ —0, establishes the desired 
converse. 
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